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SECTION 1 


INTRODUCTION 

9 

This; report summarizes our research activities in the first half of 
1984. During this period issues pertaining to the well-posedness of a 
two time scale Approach to the output feedback regulator design problem 
have been examined. An approximate quadratic performance index which 
reflects a two time scale decomposition of the system dynamics was 
developed. It is shown that, under mild assumptions, minimization of 
this cost leads to feedback gains providing a second-order approximation 
of optimal full system performance. 

A sequential numerical algorithm was defined which obtains output 
feedback gains minimizing a broad class of performance indices, 
including the standard LQ case. We have proven that the algorithm 
converges to a local minimum under nonrestrictive assumptions. This 
procedure was adopted to, and demonstrated for the two time scale 
formulation. As an additional demonstration of the breadth of the class 
of performance indices minimized by this algorithm, a procedure for 
optimally zeroing selected gain elements in an output feedback gain 
matrix was developed and demonstrated. 

This report summarizes the main theoretical results for this period. 
A contractor's report is currently being prepared that will detail all 
the major developments, and will provide computational algorithms and 
numerical results that substantiate the theoretical results. A summary 
of conference and journal publications that have resulted from this 
research is provided at the end. 


SECTION 2 


OPTIMAL OUTPUT FEEDBACK DESIGN 

In this section, the optimal output feedback problem is formulated for 
a class of problems which includes the standard LQ case. A convergent 
sequential numerical algorithm for solving the necessary conditions for 
optimality is described. Because the algorithm provides a sequence of 
monotoni cally improving gains, the solution obtained at convergence is 
locally optimal. 

2.1 Problem Formulation and Necessary Conditions for Optimality 


We consider systems of the form 

x = Ax + Bu x(0) = x 0 (2.1) 

where x e R n and u s R m , with output 

y = Cx (2.2) 

where y e R R . The control has the form 

u = -Gy (2.3) 

The gain G is to be chosen to minimize 

J = f o xT Q x ' h u T Ru dt + y(G) (2.4) 


where Q = r T r such that the pair (r,A) is detectable, and R > 0. In 
addition, it will be seen that, in order to avoid singularity in the 
necessary conditions for optimization problem, we must have 

P(C) = p (2.5) 
In (2.4), y(G).is any scalar function having a continuous gradient in G, 
and for which J is bounded below, for all G which render the closed loop 
dynamics (2. 1-2. 3) asymptotically stable. 


It is well known that the integral portion of J satisfies the 
relation 

/oX T Qx + u t Ru dt 3 tr{Kx 0 x 0 T } (2.6) 

where K ) 0 is the unique solution of 

S(G,K) - A t K + KA + Q + C t G t RGC » 0 (2.7) 

A - A - BGC • (2.8) 

and A is asymptotically stable. It is customary to relieve (2.6) of its 
dependence on x 0 by assuming that it is uniformly distributed on the unit 
sphere; then the problem statement is modified slightly to that of 
minimizing E{J}. This amounts to replacing x 0 x 0 T in (2.6) by I. 

The minimization of (2.4) is now cast, as a static optimization 
problem, in which the Lagrangian 

L (G,K,L) - tr{K) + y(G) + tr{S(G,K)L T ) (2.9) 

is minimized with respect to G, K and L, where L is a matrix of Lagrange 
multipliers. If the system (2. 1-2.3) can be stabilized by output 
feedback, the first order necessary conditions for optimality are 


Ur 

3G 


* 


= 0 




( 2 . 10 ) 


where the * ’ s mean that the gradients are evaluated at the optimal values 
of G, K and L. In the sequel, the * notation is suppressed since the 
gradients are assumed evaluated at their optimal values unless specified 
otherwise. Defining the gradient of y(G ) 


3V(G) 

9G 


= yg(g) 


( 2 . 11 ) 


the expansion of (2.10) is 

RGCLC T - B T KLC T + ^(g) = 0 


( 2 . 12 ) 
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AL + LA T + I « 0 

(2.13) 

S(G,K) - 0 

(2.14) 

From (2.12), the optimal value of G will satisfy 
G* * R" 1 [B t KLC t - yg( g >]<clc t )“ 1 

(2.15) 


where (CLC T )"'- 1 - exists because of (2.5) and the fact that L > 0 in (2.13). 
2.2 A Convergent Numerical Algorithm 

The following algorithm suggests itself for solving (2.12-2.14): 

0. Choose any G such that A is Hurwitzian. Set i * 0. 

1. Solve (2.13,2.14) for and L^. 

2. On the basis of (2.15), evaluate 

AGi = R-^BTKiLiCT - ^(G*) J (CLfC 1 )" 1 - G ± (2.16) 

3 . Set 

G^-f i — G^ + aAGj> (2.17) 

where a e (0,1] is chosen to ensure that 
Ji+1 < Ji “ tr{Ki} y(Gi) (2.18) 

4. Set i “ i + 1 and go to 1. 

This is a very simple procedure to implement, since it only involves the 
solution of two Lyapunov equations. The unfortunate necessity of 
supplying an initial stabilizing gain for step 0 is shared by other 
sequential algorithms currently available. 

The following theorem has been proven: 

Theorem 2.1 : For the optimal output feedback problem defined in 

(2. 1-2.4), let the following conditions be satisfied: 

i) G = {G : A is Hurwitzian) t 

ii) p{C) = p 

iii) Q = r T r such that (T,A) is detectable; R > 0 

iv) y(G) is C* for all G e G 
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v) If y(G) + -«• for all II G e G II + °°, then it does so in such a 

way that |y(G) | / tr{ K} < 1 

If (i-v) are true, then the sequence {Gj : 1 * 0, 1 , . . . } of 

stabilizing gains defined by (17) exists for any G 0 e G , such that 
(2.18) is satisfied at each iteration. Moreover, the sequence 
converges to a stationary point in J. 

Note that (i-iii) are the standard conditions required for solving 
the LQ optimal output feedback problem. Loosely speaking, (v) means 
that, in choosing y(G), one must be certain that it does not become 
negatively unbounded at a faster rate than tr{K} becomes positively 
unbounded for II Gil + 00 • Recall that, because of (Iv), y(G) cannot assume 
unbounded values for finite G. It should also be noted that, while the 
theorem does not rule out the theoretical possibility of convergence to a 
saddle point in J, encountering- a saddle in practice would only slow the 
convergence to a local minimum of J, since the saddle point would be 
unstable in G. 
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SECTION 3 


SPT IN OUTPUT FEEDBACK 

In this section SPT is employed to decompose an ill-conditioned 
closed-loop output feedback system into its slow and fast subsystems. 

In the process of doing so, we gain some insight into the well-posedness 
of the SPT-approximate design problem. 

3.1 Problem Formulation 


Consider the system 


XI ■ Anxi + A 12 X 2 + Biu 

*1(0) 

« XI 0 

xi e R n l 

(3.1) 

ex 2 " A21X1 + A 22 X 2 + B2U 

x 2 (0) 

" *20 

X2 e R n ^ 

(3.2) 

where 0 < e « 1 , with output 





y “ Cl XI + C 2 X 2 



yeRP 

(3.3) 

The feedback law is 





u - -Gy 

f -9 * 


ueR m 

(3.4) 


If A 22 is invertible, a reduced order approximation of (3. 1-3.3) can be 
obtained by setting e = 0 in (3.2): 


5 = A 0 + B 0 u £eR ni (3.5) 

y = C 0 K + D 0 U (3.6) 

where 

A 0 = An - M 2 A 22 A 2 I B 0 = B X - A 12 A 22 B 2 

C 0 = Ci - C 2 A 22 A 21 D 0 = -C 2 A 22 B 2 (3.7) 

Substituting (3.4) in (3. 1,3. 2) and setting e = 0, the reduced feedback 
control is expressed as 

u = -G° C 0 ? , (3.8) 

G° = (I + GDq) - ^ (3.9) 

which necessitates the assumption 

p(I + GD 0 ) = m (3.10) 


6 


The inverse of (9) is 

G - G°(I - D 0 G°)“1 (3.11) 

The following lemma was proven, which states that satisfaction of the 
invertibility conditions for (3.9) and (3.11) is simultaneous, and that 
this guarantees local one-to-one correspondence between G° and G. 

Lemma 3.1 : 

p(I - D 0 G°) « p iff p(I + GD 0 ) » m; 

furthermore, these conditions are necessary and sufficient for G° 
and G to be locally one-to-one. 

The next lemma was proven, which assures that (3.10) will hold for any G 
not rendering the fast closed-loop system singular. 

Lemma 3,2 : Given that A 22 is nonsingular, 

p(I + GD 0 ) = m iff p(A22 “ B 2 GC 2 ) = n2 
In summary, Lemmas 3.1 and 3?2 assure that the Inverses in 
(3.9,3.11) exist for any realistic design problem. Indeed, if 
A 22 “ B 2 GC 2 were singular, the fast subsystem dynamics would not be 
"fast". It should be noted that if (3.9) and (3.11) did not define a 
unique correspondence between G° and G, reduced order approximations 
would have very little utility in output feedback design. 

3.2 Asymptotic Properties 

The closed-loop system matrix for (3. 1-3.4) takes the form 
a An “ B1GC1 A12 ” B1GC2 

A = (3.12) 

(A 21 “ B 2 GC 1 ) /s (A 22 “ B 2 GC 2 )/ e 

Next, construct an invertible transformation which block 
diagonalizes A: 

X 1 

= T(e) (3. 13. a) 
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T(e) 


(3. 13.b) 



*■ 

* 


* 

eH 


I-eHN 

-ell 

T -1 (e) * 

I 

T(e) » 

N 

I 

-N 

I-eNH 



- 

m 


0 



In (3.13), K Is exclusively the slowly varying portion of the closed loop 
state and n is the fast transient. After some algebra, It can be shown 


that 

N(e) - A22(A21 “ B 2 G°C 0 ) 

+ e(I+Ai 2 B 2 G 0 C 2 )Aj 2 (A 21 ~B 2 GOC 0 )(A 0 -B 0 GOC 0 ) + 0(e 2 ) (3.14) 

H(e) - (A i2 - B 0 G°C 2 )A22 + 0(e) (3.15) 

These expressions can easily be verified if one recalls the definitions 

in (3.7) and uses the fact that, if A 22 exists, 

(A 22 “ B 2 GC 2)" 1 “ (I + A22B2G°C2)A22 (3.16) 

Using (3.13) in (3.12), the/ dynamics are decoupled: 

l = [ (A o “B g G°C o )+0( e ) ] £ ?(0) = x 10 (3.17) 

ep = [(A22“B2GC2)+0(e)]n n(0) “ X2Q”A22(A2i“B2G°C 0 )xio+0(e) (3.18) 

so that, for e sufficiently small, 

S(t) = exp[(A 0 - B o G o C o )t]S(0) + 0(e) (3.19) 

p(t) = exp((A22 ” B2GC2)t/e]ri(0) + 0(e) (3.20) 

Employing T“^(e) from (3.13) to transform back to xj , X2, we obtain 

xx(t) = ?(t) + 0(e) (3.21) 

X 2 (t) = “A 22 ^ A 2 l “ B 2 G°C 0 )?(t) + n(t) + 0(e) (3.22) 

Similarly, T”^(e) transforms u as defined by (3. 3, 3. 4): 

u(t) = ~G°C 0 C(t) - GC 2 n(t) + 0(e) (3.23) 

This development is summarized in the following theorem: 

Theorem 3.1 : If A 22 “ B 2 GC 2 is Hurwltzian, then (3.21-3.23) 


describe the full order system and control trajectories for all 
finite t > 0. Additionally, if A 0 - B 0 G°C 0 is Hurwltzian, then 
(3.21-3.23) are true for all t > 0. 
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An immediate (and crucial) consequence of this theorem is that, for 
sufficiently small e, output feedback stabilizability of the full system 
(3. 1-3,4) is equivalent to joint output feedback stabilizability of both 
subsystems. Note that the output feedback problem does not naturally 
decompose into separate slow and fast designs. Instead, G° 
and G must stabilize the separate systems (3.17,3.18) while satisfying 
the hard constraint (3.9). 


SECTION 4 


NEAR-OPTIMAL OUTPUT FEEDBACK REGULATION 
In this section, for the ill-conditioned system dynamics of Section 
3, the block diagonalizing transformation T(e) from (3.13) is applied to 
the quadratic performance criterion of Section 2. If the slow subsystem 
measurements are nonredundant Minimizing the transformed criterion at e * 
0 results in a gain solution which yields a second order approximation to 
optimal full system performance, while eliminating the dimensionality and 
ill conditioning difficulties of minimizing directly for the full system 
dynamics. 

4.1 Definition of the Approximate Problem 

The performance index for the full order system (3. 1-3.4) is 


J “ / o ^ X T » x 2 3 Q 


X 1 

x 2 


+ u^Ru dt 


(4.1) 


where R > 0 and Q “ r T r such that (T,A) is detectable. Q is compatibly 
partitioned as 


Q = 


Ql 

T 

Q2 

Q2 

Q3 


(4.2) 


Assuming that the closed-loop system matrix A in (3,12) is asymptotically 


stable, then (4.1) is equivalent to 


J = tr{Kx 0 x 0 ) 

> is f x 10* x 20] 

A t K + KA' + Q » 0 


T T T 

where x 0 is [xio,x 2 o], and R > 0 is the unique solution of 


K = 


Q » 


'Ki gk| 
sK 2 eK 3 J 
Ql + cfG T RGC]. 
Q 2 + C 2 G t RGC! 


Q 2 + CiG t RGC 2 
Q3 C 2 G t RGC 2 


(4.3) 

(4.4) 

(4.5) 

(4.6) 
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The problem of minimizing (4.3) with respect to G can be decomposed by 
using T-J-Ce) from (3.13) to transform the coordinates from xj,X 2 to 5 and 
r\. After transformation, (4,4) decouples into; 


Sl(G 0 ,Ki,e) - A Jki + KlA 0 + Qi " 0 

(4.7) 

A? 2^2 + ^ 2^0 + Q 2 “ 0 

(4.8) 

S 3 (G,K 3 ,e) - A 22 K 3 + K 3 A 22 + Q 3 - 0 

(4.9) 

A 0 - A 0 - B 0 G°Cp + 0(e) 

(4.10) 

a 22 * A 22 “ B 2 GG 2 + 0 (e) 

(4.11) 

T 

Ql - Qi - N t Q 2 - Q 2 N + N T Q 3 N + Co’g°RG°C 0 + 0(c) 

(4.12) 

Q 2 - Q 2 “ O 3 N + C 2 G t RG°C 0 + 0(e) 

(4,13) 

Q 3 - Q 3 + C 2 G t RGC 2 + 0(e) 

(4.14) 


As in Section 2, we could remove the dependence of (4.3) on initial 

conditions by assuming that they are uniformly distributed on the unit 

•**» ' 

sphere. The problem statement is then modified slightly to that of 
minimizing E{J), which amounts to replacing x 0 x 0 in (4.3) by 

the Identity matrix. For the two time scale problem, we Instead assume 

that r? T (0),n T (0)J is uniformly distributed on the unit sphere. This is 

because, under transformation by T(e) at e 3 0, the former assumption 

leads to 


5(0) [? T (0)n T (0)] 

n(0) 


•» T(0)E{x o Xo>T t (0) 

I n t 


N 


I + NN t 


(4.15) 


which is Inconveniently complicated. It should be noted from (4. 3, 4, 5) 
and (3.13) that the difference between the costs resulting from either 
assumption is only 0 (e); further, the results from this section can be 
extended to any assumption on the initial condition. 


The transformed cost for this problem is 

J " tr{Ki> + etr{K 3 > (4.16) 

Now, note that the fact subsystem performance measure is, not unexpected- 
ly, 0(e). At e ■ 0, where we would like to approximate the system dyna- 
mics, there is no cost associated with fast dynamics. On the other hand, 
minimization of tr{Ki(e * 0)} with respect to G° must be done over the 
set of gains which would also stabilize A 22 , subject to (3.11). In order 
to do this in a rational way, we Instead minimize 

jo * tr{K|(e * 0)} + e°tr{K 3 (e * 0)} (4.17) 

where e° is fixed as the value of s in (3.2). In fact, minimizing (4.17) 
allows simultaneous nrar-optimlzation of the slow and fast dynamics for 
essentially the same level of computational effort that would have been 
required to minimize tr{Ki(e “0)} alone, subject to the asymptotic sta- 
bility of the fast subsystem. This situation contrasts dramatically with 
that Wi in the singularly perturbed state feedback optimization 
problem. There, because of the complete decoupling of the slow and fast 
subsystems, the control designer has the option of only calculating gains 
for the slow dynamics, if the fast dynamics are open-loop stable and if 
an 0(e) approximation to optimal system performance is satisfactory. 

Even if the fast dynamics require stabilization, this is done as a task 
totally divorced from the slow subsystem design, and without using 

information about e. Here, in the output feedback problem, the 
# 

constraint (3.9) inseparably links the slow and fast subproblems. 

It is fairly obvious that a gain G minimizing J°, when applied to 
the full-order dynamics (3. 1-3. 4), will provide an 0(e) approximation to 
actual optimal performance. In cases where p{C 0 } “ p, however, it is 
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possible to make a stronger statement about the near-optimality of the 
approximate gain; 

Theorem 4.1 ; Given that p{C) * p, assume that ptC 0 } » p. Let G* 
be such thafc J(G*) < J(G) Tor J given by (4.16) and the dynamics 
(3. 1-3.4). Let G be such that J°(G) < J C (G) for J° given by (4.17) 
and the dynamics (3.17,3.18) at e - 0. Then, 

J(G) « J(G*) + 0(e 2 ) (4.18) 
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